Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, Agents, Policy Optimization, Reward Functions
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122470
posts in
858.4
ms
Found-RL
: foundation model-enhanced reinforcement learning for
autonomous
driving
arxiv.org
·
12h
💬
LLM
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
10h
·
Discuss:
Hacker News
💬
LLM
Blockwise
Advantage Estimation for Multi-Objective RL with
Verifiable
Rewards
arxiv.org
·
12h
🔥
PyTorch
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
·
2d
·
Discuss:
DEV
💬
LLM
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
1d
🔥
PyTorch
Optimizing post-disaster road
restoration
with reinforcement learning: A
traveler-behavior-aware
approach
sciencedirect.com
·
1h
🏕️
Survivalism
A training
principle
for
drifting
models
breno.bearblog.dev
·
6h
🤖
Machine Learning
Observe
emergent
behavior in autonomous multi-agent LLM networks
agents.glide2.app
·
2d
·
Discuss:
Hacker News
💬
LLM
Robotics
Motion Learning: Training Linked Robot Arms with
Kuramoto
Models
hackernoon.com
·
1d
🤖
AI
Multi AI Agent Systems with
crewAI
deeplearning.ai
·
6h
🤖
AI
YORU
: Animal behavior detection with object-based approach for real-time
closed-loop
feedback
science.org
·
1d
🤖
AI
Repo
Optimizer
: I Let a KISS AI Agent Optimize Itself Overnight. It Cut Its Own Cost by 98%.
dev.to
·
2h
·
Discuss:
DEV
🤖
AI
How to
Leverage
Explainable
AI for Better Business Decisions
towardsdatascience.com
·
2h
🤖
AI
A
Conceptual
Framework for Exploration
Hacking
lesswrong.com
·
1h
💬
LLM
GLM-5
: From
Vibe
Coding to Agentic Engineering
simonwillison.net
·
22h
·
Discuss:
Hacker News
💬
LLM
Feedback
Control for Computer Systems
janert.org
·
10h
🤖
AI
The 4 Mixture of Experts Architectures: How to Train
100B
Models at
10B
Cost
pub.towardsai.net
·
4h
🔥
PyTorch
JupyterPS/VBAF
: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation
github.com
·
2d
·
Discuss:
Hacker News
🤖
AI
A
masterclass
in AI security
operations
redcanary.com
·
4h
🤖
AI
Recursive
self-improvement
from AI models
marginalrevolution.com
·
1d
·
Discuss:
Hacker News
🤖
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help